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TITLE 


OF 


INVENTION: THERMAL TOLERANT MANNANASE FROM ACIDO' 
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CELLULOLYTICUS 
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<130> 


FILE REFERENCE: 40197 
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<140> 


CURRENT APPLICATION NUMBER: 09/917,378 
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CURRENT FILING DATE: 2 001- 


-07-28 
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NUMBER OF 
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SOFTWARE : 
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<210> 


SEQ ID NO 
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<211> 


LENGTH: 762 
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<212> 


TYPE : 


PRT 


























24 


<213> 


ORGANISM: 


Acidothermus cellulolyticus 












26 


<400> 


SEQUENCE : 


1 
























27 


Met 


Gly 


Leu 


Val 


Arg 


Arg 


Pro 


Ala 


Arg 


Ala 


Phe 


Val 


Ala 


Thr 


Ala 


Ala 


28 
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15 
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Gly 


Thr 


Ala 


Val 


Ala 


Ala 


Ala 


Ala 
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Gly 


Ser 
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Met 


Pro 
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Ser 


Ala 


Thr 


Ala 


Ala 


Pro 


Ala 


Gly 


Phe 


Val 


Thr 


Ala 


Ser 


Gly 


Gly 


Gin 


34 








35 
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45 








36 


Phe 


Val 


Leu 
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Gly 
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Tyr 


Arg 


Tyr 


Gly 
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Thr 
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Asn 


Tyr 
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55 
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Asp 


Val 


Asp 
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Val 
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139 
141 
142 
144 



148 
150 
151 



595 600 
Ser Pro Ser Pro Thr Pro Ser Pro Ser Pro Thr 

610 615 
Ser Pro Thr Pro Ser Pro Ser Ser Ser Pro Ser 
145 625 630 635 

147 Ser Pro Ser Pro Thr Pro Ser Pro Ser Pro Ser 

645 650 
Ser Val Ser Ser Ser Gly Val Gly Cys Arg Ala 
660 665 

153 Ser Asp Trp Gly Ser Gly Phe Thr Ala Thr Val 

154 675 680 

156 Gly Ser Arg Ala Thr Ser Gly Trp Thr Val Ala 

157 690 695 

159 Asn Gin Thr Val. Thr Asn Tyr Trp Asn Thr Ala 

160 705 710 715 

162 Ala Ser Val Thr Ala Thr Asn Leu Ser Tyr Asn 

163 725 730 

16 5 Gly Gin Ser Thr Thr Phe Gly Phe Asn Gly Ser 



Pro 
620 
Pro 

Pro 

Thr 

Thr 

Trp 
700 
Leu 

Asn 

Tyr 



605 
Ser 

Ser 

Ser 

Tyr 

Val 
685 
Ser 

Thr 

Val 

Ser 



166 



740 



745 



Pro Ser Pro 

Pro Ser Pro 
640 

Pro Ser Pro 

655 
Val Val Asn 
670 

Thr Asn Thr 

Phe Gly Gly 

Gin Ser Gly 
720 

lie Gin Pro 

735 
Gly Thr Asn 
750 



168 Thr Ala Pro Thr Leu Thr Cys Thr Ala Ser 



169 



760 



755 

172 <210> SEQ ID NO: 2 

173 <211> LENGTH: 2289 

174 <212> TYPE: DNA 

175 <213> ORGANISM: Acidothermus cellulolyticus 

177 <400> SEQUENCE: 2 

178 atgggtctag tgcgtcgccc tgcgcgagca tttgttgcga 

179 gctgccgcgg cgacgctcgg ctcaatcacc atgccgtcag 

180 ttcgtcaccg catccggcgg tcagttcgtt ctgaacggcc 

181 acgaacaact attacctcag ctatcagtcg cacgccgacg 

182 gctcaagcga tgaatctttc tgtcatccgg acctggggtt 

183 gacggctccg tgcccacaat cgatggcaac aagaacggct 

184 ccgtcgaccg gcgctccggc gtacaacgac gggccgaccg 

185 gcgatcgcga gcgcggccgc gcacggcctt cgggtgattg 

186 aaagaatttg ggggaatgga tcaatacgac aagtggtacg 

187 ttctacaccg acccccggac ccagcaggcg tacaagaatt 

188 cgggtcaaca gcattaccgg cgtgacgtac aagaacgatc 

189 cttgccaatg agccgcgctg cgtaggaagc ggcacattac 

190 caggcgacca ttgtcaactg ggtcgatcaa atgtcggcgt 

191 aaccatatgg tctcggtcgg cgacgaaggg ttctacattg 

192 tggccataca acgacccgtc cgacggcgtc gacaacaatg 
19 3 attgactttg gcacgtatca cctgtacccg aattactggg 

194 acgcaatgga tcaaggatca tattgcgaat gccgcagcga 

195 gaagaattcg gctggcagac accggaccgc gattccgtct 

196 gtgcgtacga acggtgaagc aggctggaac ttctggatgc 

197 cagccatatc cgaactatga cggcttcaac gtctactacc 

198 ctcgccagcg aggcgctcgc aatcagtacc ggcacatcgc 

199 agtccatcct cgtcgccgtc tccgtcgccg tctccgtcgg 

200 tcggcgtctt cgtcgccgag cccgtctccg tcgtcgtcgc 



ccgcggccgg 
ccacggcagc 
ttccctatcg 
tcgatgacgt 
tcatcgacat 
tctactttca 
gcttgcaagg 
tcgtcctcac 
gccttcctta 
gggtcaatca 
caacgatctt 
caacctcggg 
acgtcaaaag 
ggtcaacgca 
ctcttctccg 
gccagaacgc 
tcggcaagcc 
atcagacgtg 
tcgctgggaa 
caagttcaac 
ctccgccgtc 
cgtctccgtc 
cggtgtcggg 



cactgccgtt 
gccggcggga 
ttacggggga 
gttggccaag 
cggctctctt 
gtactgggac 
ccttgactac 
caacgactgg 
ccacgacaac 
tctactgaac 
tgcttgggaa 
cacgtgcact 
catagaccct 
gggaagcggc 
tgtcaagaac 
ggactgggga 
gaccattctc 
gacccagact 
tgtcaacggc 
agcgaccgtc 
gccgagctcg 
ggcgtctccg 
tggggtgaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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201 gtgcagtaca agaacaatga ttcggcgccg ggtgataacc agatcaaacc gggtctccag 1440 

2 02 ttggtgaata cggggtcgtc gtcggtggat ttgtcgacgg tgacggtgcg gtactggttc 1500 

203 acccgggatg gtgggtcgtc gacactggtg tacaactgtg actgggcggc gatggggtgt 1560 

204 gggaatatcc gcgcctcgtt cggctcggtg aacccggcga cgccgacggc ggacacctac 1620 

205 ctgcagttgt cgttcactgg tggaacgttg gccgctggtg ggtcgacggg tgagattcaa 1680 

206 aaccgggtga ataagagtga ctggtcgaac tttgatgaga ccaatgacta ctcgtatggg 1740 
2 07 acgaacaccg ccttccagga ttggacgaag gtgacggtgt atgtcaatgg ccggctggtg 1800 

208 tgggggactg aaccgtccgg caccagcccc agccccacac ccagccccag cccaaccccg 1860 

209 tccccgagcc cgagcccgac cccaagcccc agctcctccc catccccgtc cccgagcccc 1920 

210 agccccagcc ctacgccgtc cccgtcgccg agcccgtcgc cgtcgccgag tgtgtcgtcg 1980 

211 tcgggtgtgg ggtgccgggc gacgtatgtg gtgaatagtg attggggttc tgggtttacg 2040 

212 gcgacggtga cggtgacgaa taccgggagc cgggcgacga gcgggtggac ggtggcgtgg 2100 

213 tcgtttggtg ggaatcagac ggtcacgaac tactggaaca ctgcgttgac ccaatcaggt 2160 

214 gcatcggtga cggcgacgaa cctgagctac aacaacgtga tccaaccggg tcagtcgacc 2220 

215 accttcggat tcaacggaag ttactcagga acaaacaccg cacctacact cacctgcacg 2280 

216 gctagttga 2289 

219 <210> SEQ ID NO: 3 

220 <211> LENGTH: 375 

221 <212> TYPE: PRT 

222 <213> ORGANISM: Artificial Sequence 

224 <220> FEATURE: 

225 <223> OTHER INFORMATION: Description of Artificial Sequence: GH5 catalytic 

226 domain 

228 <400> SEQUENCE: 3 



229 


Ala 


Pro 


Ala 


Gly 


Phe 


Val 


Thr 


Ala 


Ser 


Gly 


Gly 


Gin 


Phe 


Val 


Leu 


Asn 


230 


1 








5 










10 










15 




232 


Gly 


Leu 


Pro 


Tyr 


Arg 


Tyr 


Gly 


Gly 


Thr 


Asn 


Asn 


Tyr 


Tyr 


Leu 


Ser 


Tyr 


233 
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25 










30 
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Gin 


Ser 


His 


Ala 


Asp 


Val 


Asp 


Asp 


Val 


Leu 


Ala 


Lys 


Ala 


Gin 


Ala 


Met 


236 






35 










40 










45 
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Asn 


Leu 


Ser 


Val 


He 


Arg 


Thr 


Trp 


Gly 


Phe 


He 


Asp 


He 


Gly 


Ser 


Leu 
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Val 
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He 


Asp 


Gly 
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Gly 


Phe 
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Gly 


Ala 


Pro 


Ala 


Tyr 


Asn 


Asp 


Gly 
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90 
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Gly 


Leu 


Gin 


Gly 
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Asp 


Tyr 


Ala 
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Ala 


Ser 


Ala 


Ala 


Ala 


His 
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105 
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Gly 


Leu 


Arg 


Val 


He 


Val 


Val 


Leu 


Thr 
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Asp 
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Lys 


Glu 
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Gly 
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Asp 
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Pro 
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Ala 
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He 
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Z 0 D 










970 
z / u 






o Q n 
z oU 


Tyr 


Pro 


Asn 


Tyr 


Trp 


vjxy 




Asn 




Asp 


irp 


vjx y 


X iiX 


r;l n 

VjX 11 


irp 


Tl 
X xc 


Z O J. 






9 7 ^ 

A f O 










9ft 0 
z o w 










2 85 








OPT 
ZOO 


Lys 


Asp 


nJLS 




nX d 


As n 


nXct 


riX d 


A 1 ;i 
nXd 


Tl 

X X 


vJXy 


jjy o 


Pro 


Thr 


He 


Leu 


0 P A 
Z 04 




290 










9 Q ^ 
Z ^ ^ 




















0 P(^ 
z 00 


Glu 


Glu 


Phe 


Gly 


Trp 


r"! n 

vjxn 


1 nx 


Pro 


Asp 






OCX 


V;^ 1 
V d X 


lyx 


f^l n 


Thr 


O P 7 


305 










jX u 










'^1 R 
J X ^ 










320 


O P Q 

z oy 


Trp 


Thr 


Gin 


Thr 


Val 


Arg 


Thr 


Asn 


Gly 


fZl n 
oX Ll 


A1 3 
n.Xd 


f^l V 

vjxy 


irp 


A Q n 
noil 


Phe 


J.X p 


T Q ri 

z y u 










325 










7 n 

J o u 










? s 

o o ^ 




TOO 

Ay A 


Met 


Leu 


Ala 


Gly 


Asn 


Val 


Asn 


Gly 


Gin 


Pro 


Tyr 


Pro 


Asn 


Tyr 


Asp 


fZl V 
vjx y 


0 Q "5 
z ^ J 








340 










345 










350 






O Q 

z y D 


Phe 


Asn 


Val 


Tyr 


Tyr 


Pro 


Ser 


Ser 


Thr 


Ala 


Thr 


Val 


Leu 


Ala 


Ser 


Glu 


0 Q 

z y o 






355 










360 










365 








O Q Q 

zy o 


Ala 


Leu 


Ala 


He 


Ser 


Thr 


Gly 




















O Q Q 

z y y 




370 










375 




















oUz 


<210> SEQ ID NO: 


: 4 
























TAT 


<211> LENGTH: 154 


























<212> TYPE: 


PRT 


























A R 

J u o 


<213> ORGANISM: 


Artificial Sequence 
















•3 n 7 

OK) f 


<2 2 0> FEATURE: 


























rip 
J u o 


<22 3> OTHER 


INFORMATION; 


: Description of 


Artificial Sequence: 


Pa T*hohvdra te 


"3 A Q 

ouy 




binding domain 






















oil 


<4 00> SEQUENCE: 


4 
























il^ 


Val 


Ser 


Gly 


Gly 


Val 


Lys 


Val 


Gin 


Tyr 


Lys 


Asn 


Asn 


Asp 


Ser 


Ala 


Pro 


olo 


1 








5 










10 










15 




ol O 


Gly 


Asp 


Asn 


Gin 


He 


Lys 


Pro 


Gly 


Leu 


Gin 


Leu 


Val 


Asn 


Thr 


Gly 


OCX 


TIC 

Jib 








20 










25 










30 






olo 


Ser 


Ser 


Val 


Asp 


Leu 


Ser 


Thr 


Val 


Thr 


Val 


Arg 


Tyr 


Trp 


Phe 


Thr 


Arg 


"3 T Q 

jiy 






35 










40 










45 








321 


Asp 


Gly 


Gly 


Ser 


Ser 


Thr 


Leu 


Val 


Tyr 


Asn 


Cys 


Asp 


Trp 


Ala 


Ala 


Met 


322 




50 










55 










60 










324 


Gly 


Cys 


Gly 


Asn 


He 


Arg 


Ala 


Ser 


Phe 


Gly 


Ser 


Val 


Asn 


Pro 


Ala 


Thr 


325 


65 










70 










75 










80 . 


327 


Pro 


Thr 


Ala 


Asp 


Thr 


Tyr 


Leu 


Gin 


Leu 


Ser 


Phe 


Thr 


Gly 


Gly 


Thr 


Leu 


328 










85 










90 










95 




330 


Ala 


Ala 


Gly 


Gly 


Ser 


Thr 


Gly 


Glu 


He 


Gin 


Asn 


Arg 


Val 


Asn 


Lys 


Ser 


331 








100 










105 










110 






333 


Asp 


Trp 


Ser 


Asn 


Phe 


Asp 


Glu 


Thr 


Asn 


Asp 


Tyr 


Ser 


Tyr 


Gly 


Thr 


Asn 


334 






115 










120 










125 








336 


Thr 


Ala 


Phe 


Gin 


Asp 


Trp 


Thr 


Lys 


Val 


Thr 


Val 


Tyr 


Val 


Asn 


Gly 


Arg 


337 




130 










135 










140 
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